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Abstract 

The  Fermi-Dirac,  Bose-Einstein  and,  for  completeness  the  Maxwell- 
Boltzmann,  distributions  are  obtained  respectively  from  considerations  of 
binomial,  negative  binomial,  and  Poisson  assemblies.  The  method  used  has  the 
simplicity  of  the  traditional  derivations  that  are  based  on  combinatorial 
considerations  but  involves  neither  the  identification  of  most  probable  values 
with  mean  values  nor  the  invocation  of  large  numbers  of  particles  involved  in 
the  use  of  Stirling’s  approximation  for  factorials.  The  method  thereby  also 
relaxes  the  requirement  for  large  numbers  of  particles  needed  in  other  avail¬ 
able  derivations  that  also  use  mean  values,  namely,  the  Darwin-Fowler  method 
of  steepest  descents  and  the  Khinchin  method  that  employs  limit  theorems  of 
the  theory  of  probability. 
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Introduction 


T 

i 

! 

Although  quantum  statistics  have  been  consistently  verified  enq>irically 
in  the  approximately  sixty  years  since  their  introduction,  there  remain  dis¬ 
quieting  aspects  about  the  way  in  which  Fermi-Dirac  and  Bose-Einstein  occupa¬ 
tion  factors  are  derived.  The  most  straightfoward  way^  to  derive  these  occu¬ 
pation  factors  uses  combinatorial  methods  to  enumerate  distinguishable 
arrangements  for  indistinguishable  particles  described  respectively  by  anti¬ 
symmetric  wave  functions  for  Fermi-Dirac  statistics,  and  symmetric  wave  func¬ 
tions  for  Bose-Einstein  statistics.  With  the  enumeration  in  hand  and  fac¬ 
torials  expressed  in  the  Stirling's  approximation,  an  extremum  principle 
subject  to  appropriate  constraints  on  total  number  of  particles  and  total 
energy  is  used  to  yield  the  most  probable  distribv^ 'on  of  particles.  This 
most  probable  distribution  is  considered  equivalent  to  the  mean  value  dis¬ 
tribution  which  is  in  turn  directly  related  to  the  desired  occupation  factor. 

Although  the  difference  between  the  most  probable  distribution  and  the 
mean  value  distribution  is  not  great,  dissatisfaction  with  the  use  of  this 
mathematical  approximation  in  the  derivation  of  the  fundamental  quantiun  sta¬ 
tistical  distributions  has  led  to  alternative,  more  complicated  but  rigorous, 

derivations  based  on  mean  values.  These  alternative  derivations  include  the 
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Darwin-Fowler  method  of  steepest  descents,  and  Khinchin's  use  of  limit 

3 

theorems  of  the  theory  of  probability.  However,  even  these  derivations  share 
with  the  combinatorial  approach  the  requirement  that  the  total  number  of 
particles  under  consideration  be  very  large.  Since  quantum  statistics  applies 
even  for  small  total  number  of  particles,  it  is  desirable  to  develop  a  deriva¬ 
tion  that  both  uses  mean  values  and  relaxes  the  requirement  that  the  total 
number  of  particles  under  consideration  be  very  large.  This  is  indeed  the 
object  of  the  present  paper. 


The  new  approach  used  here  has  the  simplicity  of  the  combinatorial  method 

but  deals  directly  with  mean  values  and  does  not  require  the  approximation  of 

a  large  total  number  of  particles.  The  starting  point  is  the  use  of  Bernoulli 

trials  and  related  probability  distributions,  namely  the  binomial,  negative 

binomial,  and  Poisson  distributions.  Assemblies  of  such  distributions  are 

used  respectively  to  derive  the  Ferroi-Dirac  (sec.  II),  Bose-Einstein 

(sec.  Ill),  and  Maxwell-Boltzmann  (sec.  IV)  occupation  factors  by  application 

of  an  entropy  extremum  principle.  The  entropy  extremum  principle  is  closely 

4 

related  to  one  recently  used  by  Grandy  to  derive  the  various  occupation 
factors.  An  important  distinction  between  Grandy 's  approach  and  the  present 
one  is  the  basis  for  the  definition  of  entropy. 

The  present  approach  shares  with  its  predecessors  an  apparent  requirement 
that  the  particles  are  non-interacting,  i.e.,  that  many  particle  quantities 
are  built  up  from  one  particle  entities.  This  seems  paradoxical  since  the 
relative  particle  occupancies  should  be  based  on  some  sort  of  coupling  among 
the  particles  either  directly  or  through  a  thermal  reservoir.  In  fact  the 
assumption  of  non-interacting  particles  may  be  understood  to  mean  that  a 
one-particle  basis  is  valid  at  thermal  equilibrium  in  that  all  interactions 
are  subsumed  into  the  manner  in  which  constraint  conditions  are  introduced. 
From  that  viewpoint,  the  assumption  of  non-interacting  particles  is  appropri¬ 
ate  for  the  derivation  of  occupation  factors  for  particles. 

After  the  various  occupation  probabilities  are  derived  in  the  next  three 
sections,  concluding  remarks  are  made  in  section  V. 

II.  Binomial  Assemblies  and  Fermi-Dirac  Statistics 

A  binomial  assembly  consists  of  all  possible  binomial  distributions 
generated  in  the  expansion  of 

(p+q)**  =  1  (2.1) 
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where  R  is  the  number  of  trials,  p  is  the  probability  of  success  in  a  trial, 
and  q=l-p  is  the  probability  of  failure.  A  typical  term,  say  the  Nth  term 
provides  the  binomial  probability  (frequency)  function  for  N  successes  in  R 
trials. 

i 

Fr(N,p)  =  (J)  pV”  (2.2) 

I  . 

I  ’ 

The  mean  value  for  the  number  of  successful  trials  is 

! 

] 

<N>  =  Jq  N  Fj,(N,p)  =  p  (p+q)*^  =  Rp  (2.3)  j 

If  the  number  of  trials,  R,  is  identified  as  the  number  of  possible  occupa¬ 
tions  or  cells,  and  the  mean  number  of  successes,  <N>,  is  identified  as  the 
mean  number  of  occupations,  then  p  may  be  identified  as  the  occupation  factor. 

In  this  way  the  language  of  the  binomial  distribution  is  translated  into  the 
language  of  particle  statistics. 

First  observe  that  indeed  the  combinatorial  factor  on  the  right  hand  side 
of  £q.  (2.2),  („),  provides  the  number  of  distinguishable  ways  of  achieving  N 
occupations  in  R  cells  when  the  exclusion  principle  (the  occupation  number  for 
individual  cells  may  be  0  or  1)  applies.  Thus  contact  is  immediately  estab¬ 
lished  with  the  concepts  usually  involved  in  Fermi -Dirac  statistics.  The 
combinatorial  factor  will  be  designed  the  weight  function 

^  ~  N!  (R-N)! 

The  remaining  factor  in  the  right  hand  side  of  Eq.  (2.2)  will  be  designated 
the  load  function. 

Ap(N,p)  £  p**q**  **  (2.5) 


Equation  (2.2)  then  takes  the  form, 
FpCH.p)  =  W^(N)  A^(N,p) 
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(2.6) 


It  is  important  to  note  that  the  mean  number  of  occupations  <N>  is  com¬ 
puted  with  the  help  of  the  complete  Fjj(N,p).  Given  this  <N>  and  Eq,  (2.3), 
£nAj^(<N>,p)  may  be  expressed  in  the  form 

£nAjj(<N>,p)  =  Rlp£np+(l-p)£n(l-p)]  =  -RS(p)/k  (2,7) 

Here  S(p)=-k(p£np+(l-p)£n(l-p)]  is  the  Shannon  entropy^*^  of  the  occupation 
factor  p,  where  k  is  the  Boltzmann  constant. 

Consider  now  a  set  of  J  binomial  assemblies  each  labelled  by  an  index  j 
corresponding  to  physical  levels.  The  assemblies  will  be  considered  non¬ 
interacting  so  that  any  coupling  that  influences  their  relative  occupations  is 
subsumed  into  subsidiary  constraint  conditions.  Then  the  possible  distribu¬ 
tions  for  the  set  are  generated  by  the  product  of  all  possible  distributions 
for  the  individual  assemblies: 

jdj  {p(j)  +  q(j)}’^^-^^  =  1  .  (2.8) 

A  typical  distribution  may  be  characterized  by  a  set  of  numbers  of  occupations 
within  the  individual  assemblies 

N  =  IN(1),  N(2),  ....  N(J)]  (2.9) 

for  the  complete  set  of  occupation  possibilities  (cells), 

R  =  (R(l),  R(2),  ...,  R(J)]  .  (2.10) 

The  probability  for  N  occupations  in  R  cells  is  then 

E)  =  jil  WR(j)(N(j))\(j)(N(j),p(j))  (2.11) 

where 

E  =  Ip(l).p(2) . P(J)]  (2.12) 

Equation  (2.11)  may  be  written  in  the  form 

Fj,(N,£)  =  W^(N)Aj^(N(N,£)  ,  (2.13) 
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where 


and 


“r'S)  >  ji,  «B(J)WO» 

VS'E>  *  jii • 


(2.1A) 
(2. IS) 


In  the  traditional  combinatorial  treatment^  of  Fermi-Dirac  statistics, 
£nWj^(N)  is  expressed  in  Stirling’s  approximation  and,  subject  to  subsidiary 
conditions,  maximized  with  respect  to  N(j).  The  maximization  is  carried  out 
by  the  method  of  Lagrange  undetermined  multipliers  into  which  the  particles' 
interaction  with  a  heat  reservoir  are  subsumed.  The  eventual  result  is  the 
most  probable  value  of  N(j).  No  attention  is  paid  to  the  load  function  in 
this  traditional  approach.  £nWj^(N)  is  related  to  the  entropy  so  that  this 
procedure  is  an  entropy  maximum  principle. 

The  emphasis  in  the  present  treatment  is  on  the  mean  number  of  occupa¬ 
tions,  i.e.,  <N(j)>  or  equivalently,  by  Eq.  (2.3),  the  occupation  factors 
p(j).  A  traditional  minimization  principle  is  applied  to  the  function 
£nAj^(<N>,£)  subject  to  subsidiary  conditions,  by  variation  with  respect  to 
p(j)-  Note  that 

£aAjj(<N>,£)  =  jijR(j){p(j)£np(j)+fl-p(j)j£n[l-p(j)]} 

s  -  R(j)  S(p(j))/k  .  (2.16) 

Here  S(p(j)),  the  Shannon  entropy  associated  with  p(j),  is  a  function  only  of 
the  p(j)  since  the  R(j)'s  are  fixed. 

The  subsidiary  conditions  here  consist  of  the  requirements  that  the  sum 
of  the  mean  numbers  of  occupations, 

<H>  S  jij  <N(j)>  =  R(j)p(j)  ,  (2.17) 

and  the  sum  of  the  mean  number  of  occupations  multiplied  by  the  level  energy 
of  the  respective  assembly. 


<E>  =  <H(j)>  E(j)  =  ji,  E(j)R(j)p(j)  , 


(2.18) 
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and  both  constants.  It  is  straightforward  then  to  extremize 

F  s  {£nAj^(<N>,£)  +  o<N>  +  P<E>}  ,  (2.19) 

where  o,P  are  Lagrange  multipliers  which  turn  out  to  have  their  usual  mean¬ 
ings,  namely,  p=l/kT  and  a=-pp  with  p  the  chemical  potential  and  T  the  abso¬ 
lute  temperature.  The  extremum  condition  6F/6p(j)=0  leads  to  the  extremum 
occupation  factor  which  will  be  denoted  by  f(j). 

f(j)  =  (l+expla+pE(j)])'^  (2.20) 

This  is  of  course  the  Fermi-Dirac  occupation  factor.  Further  it  should  be 
noted  that 

(6V6p(j)^)p^jj^^^jj  =  R(j)e“'"P^^j^e“*P^^j^+l)^>0  (2.21) 

so  that  the  Fermi-Dirac  occupation  factor  minimizes  F. 

From  Eq.  (2.16),  it  is  clear  that  the  minimization  principle  for  F  is 
related  to  a  maximization  of  the  Shannon  entropy  of  the  underlying  collection 
of  binomial  assemblies.  This  relationship  will  be  discussed  further  in  sec¬ 
tion  V  below. 

It  should  also  be  pointed  out  that  while  the  weight  functions  were  not 
involved  in  the  extremum  principle,  the  entire  probability  functions  were 
employed  in  establishing  the  relation  between  mean  value  numbers  and  occupa¬ 
tion  factors.  Thus  all  the  quantities  that  enter  into  the  binomial  assem¬ 
blies  enter  into  the  present  approach. 

As  a  final  observation  in  this  section,  it  may  be  noted  that  passage  to 
continuous  energy  spectrum  is  straightforward.  All  that  is  required  is  to 
interpret  R(j)  as  the  number  of  states  in  the  sheet  between  energies  E(j)  and 
E(j)+dE(j). 
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III.  Negative  Binomial  Assemblies  and  Bose-Einstein  Statistics 

The  Bose-Einstein  occupation  factor  may  be  derived  from  negative  binomial 
assemblies  in  a  manner  similar  to  the  one  used  for  Fermi-Dirac  statistics  in 
the  previous  section.  However,  the  derivation  does  introduce  an  interesting 
subtlety  in  the  definition  of  the  occupation  factor. 

A  negative  binomial  assembly  consists  of  all  possible  distribution  gen¬ 
erated  in  the  expansion  of 

P^(l-q)‘^  =  1  (3.1) 

where  R  is  the  number  of  successful  trials.  A  typical  term  in  the  expansion, 
say  the  Nth  term,  provides  that  probability  function  that  there  will  be  R 
successes  at  the  (R->'N))th  trial 

T*  /M  ^  .R'^N— Iv  R  N  o  o  ^ 

Fjj  (N,p)  =  (  j,  )  P  q  .  (3.2) 

The  mean  value  of  the  number  of  trials  beyond  R  necessary  to  achieve  R  suc¬ 
cesses  is 

<N>  =  R(q/p)  (3.3) 

<N>  is  just  the  mean  number  of  failures  that  occur  along  with  R  successes. 
Although  the  range  of  N  in  taking  the  mean  value  here  is  this  does  not 

represent  a  requirement  that  N  be  large,  only  that  it  may  be  large.  The 
actual  specification  of  particle  number  is  introduced  through  a  subsidiary 
constraint  condition. 

Again  if  R  is  identified  as  the  number  of  cells,  then  <N>  may  be  viewed 
as  the  mean  number  of  occupations  in  the  R  cells.  The  order  of  the  failures 
is  imaaterial  and  their  number  is  not  limited  so  that  their  properties  coin¬ 
cide  with  Bose-Einstein  occupations.  However  in  this  case 


must  be  ideatified  as  the  occupation  factor.  Note  that  0<p<*  since  0<p<l. 
Equation  (3.2)  is  readily  rewritten  in  terms  of  p,  namely 


p  fv  rR+N-K  ,  1  .R+N  -  N 

Fr(N,P)  =  (  jj  )  (  —  )  P 

1+p 


(3.5) 


The  combinatorial  factor  on  the  right  hand  side  is  the  one  that  enters  in  the 
traditional  treatment  of  Bose-Einstein  statistics  but  is  here  the  weight 
function. 


W^(N) 


(Ri-N-l)! 

(R~1)!N! 


The  remaining  factor  is  the  corresponding  load  function 


(3.6) 


Ap(N,p)  =  (  -  )  p 

1+p 


(3.7) 


Equations  (3.3)  and  (3.4)  may  be  used  to  express  £nAj^(<N>,p)  in  terms  of  S(p), 
the  analog  of  the  Shannon  entropy  of  the  occupation  factor  p.  S(p)  is  only 
an  analog  since  p  is  an  occupation  factor  rather  than  a  probability. 


AnAj^(<N>,p)  =  R[p£np-(l+p)£n(l+p])H-RS(p) 


(3.8) 


In  a  manner  completely  analogous  to  that  of  the  previous  section,  con¬ 
sideration  of  a  set  of  negative  binomial  assemblies  leads  to  the  probability 
of  N  occupations  in  R  cells 


Fr(N.E)  =  Wp(N)A^(N,E)  (3.9) 

Here 

-  jii  ^R(j)^**^j^^ 

The  procedure  is  here  to  extremize 

F  H  {£iu^(<N>,E)  +  o<N>  ♦  P<E>}  (3.12) 
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where  the  constraints  Eqs.  (2.17)  and  (2.18)  here  have  the  same  form  but  with 
p(j)  replacing  p(j).  The  extremum  occupation  factor  which  will  be  denoted  by 
^>(1)  is  just  the  Bose-Einstein  occupation  factor. 

b(j)  =  fexp(of+pE(j))-l  ]  ^  (3.13) 

Again  this  occupation  factor  corresponds  to  a  minimum  of  F  or  a  maximization 
of  the  Shannon  entropy  of  the  underlying  collection  of  negative  binomial 
assemblies.  Treatment  of  a  continuous  energy  spectrum  can  be  handled  in  the 
same  way  as  in  section  II. 


IV.  Poisson  Assemblies  and  Maxwell-Boltzroann  Statistics 

Maxwell-Boltzmann  statistics  can  be  obtained  as  a  high  energy  limit  of 
either  Fermi-Dirac  or  Bose-Einstein  statistics.  Alternatively  one  can  obtain 
the  Maxwell-Boltzmann  occupation  factor  by  considerations  completely  analogous 
to  those  of  the  previous  two  sections  for  the  Poisson  limit  of  binomial  (and 
negative  binomial)  assemblies.  Such  a  limit  holds  when  all  R(j)  are  large  and 
all  p(j)  (or  p(j))  are  small.  In  this  limit,  the  analogs  of  Eqs.  (2.14)  and 
(2.15)  (or  Eqs.  (3.10)  and  (3.11)  become 

«P(N)  =  (R<j)'*'''VN(j)l  (4.1) 

a''.CH)«)  =  .d,  (4.2) 

K  J-1 

where  m(j)  is  either  p(j)  or  p(j).  It  may  be  noted  that  such  weight  and  load 
functions  are  also  immediately  obtained  from  assembly  frequency  functions 
obtained  from  expansion  of  the  identity 


-Rffl  Rm 

e  e  = 


(4.3) 


In  any  event,  minimzation  of  £n/^(<N>,m)  subject  to  subsidiary  conditions 
Eqs.  (2.17)  and  (2.18)  with  m(j)  identified  as  the  occupation  factor  yields 
the  Maxwell-Boltzmann  occupation  factor. 


However,  it  may  be  observed  that  the  weight  function  in  Eq.  (A.l)  does 
not  coincide  exactly  with  the  usual  combinatorial  term  that  arises  in  the 
discussion  of  Maxwell -Boltzmann  statistics.  Namely  it  lacks  a  factor  of  N!, 
where  N  is  the  total  number  of  particles.  Such  a  factor  can  readily  be 
inserted  in  the  weight  function  if  its  inverse  is  inserted  in  the  load  func¬ 
tion.  The  important  point  is  that  such  a  factor  does  not  enter  into  the 
extremum  procedure  and,  in  this  sense,  is  a  gauge  factor  in  the  determination 
of  extremum  occupation  factors.  Such  a  gauge  factor  could  also  have  been 
introduced  in  the  assembly  collection  weight  and  load  functions  of  the  previ¬ 
ous  sections  without  changing  the  results  of  those  sections. 

The  fact  that  formalisms  used  to  describe  various  types  of  particle 
statistics  can  be  made  to  coincide  in  some  limit  does  not  mean  that  the  par¬ 
ticles  whose  statistics  are  being  described  coincide  themselves.  The  statis¬ 
tics  may  be  related  but  the  particles  retain  their  particular  symmetry  proper¬ 
ties  and  their  distinguishability  or  indistinguishability . 

V.  Concluding  Remarks 

The  present  derivation  of  Fermi-Dirac,  Bose-Einstein,  and  Maxwell- 
Boltzmann  occupation  factors  is  based  on  a  starting  point  in  which  both  occu¬ 
pation  possibilities  and  occupation  configurations  enter.  Occupation  possi¬ 
bilities  are  represented  by  the  weight  functions,  and  occupation  configura¬ 
tions  are  represented  by  the  load  functions.  The  logarithm  of  the  load  func¬ 
tion  has  been  shown  to  be  related  to  the  negative  of  the  Shannon  entropy  (or 
its  analog)  of  these  configurations.  Application  of  a  minimum  principle  to 
the  logarithm  of  the  load  functions  yields  the  usual  results  for  particle 
statistics  without  identification  of  most  probable  values  with  mean  values  or 
the  invocation  of  a  large  number  approximation  incorporated  in  Stirling's 
approximation.  The  present  approach  therefore  seems  to  be  more  sound  funda¬ 
mentally. 
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However,  the  fact  that  the  traditional  combinatorial  approaches  yield  the 
same  result  is  no  accident  but  instead  reflects  a  relationship  between  respec¬ 
tive  weight  and  load  functions.  The  logarithm  of  a  weight  function  provides  a 
combinatorial  entropy  whereas  the  logarithm  of  a  load  function  provides  the 
negative  of  a  configurational  Shannon  entropy.  Thus  an  extremum  principle  on 
one  is  the  complement  of  the  other.  Since  the  present  approach  employs  a 
minimum  principle  for  the  negative  of  the  Shannon  entropy  (or  its  analog), 
both  approaches  can  be  viewed  as  employing  maximum  entropy  principles.  How¬ 
ever,  it  may  be  more  appropriate  to  maintain  focus  on  the  negative  entropy 
aspect  and  a  minimum  principle.  Indeed  one  can  interpret  a  set  of  particles 
organized  with  a  specific  statistics  to  be  a  manifestation  of  local  order  that 
should  be  characterized  by  a  negative  entropy  contribution.  Overall  maximum 
disorder  or  entropy  would  then  be  achieved  by  a  minimization  of  the  negative 
entropy  contribution. 

Thus  the  present  approach  not  only  uses  mean  values  rather  than  most 
probable  values  and  relaxes  the  requirement  for  large  total  number  of  par¬ 
ticles,  but  it  also  allows  one  to  view  the  distribution  of  particles  according 
to  a  particular  occupation  statistics  to  be  an  ordering  process.  All  these 
features  seem  to  be  advantageous. 
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